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1 CAACCATTGC AGATCAGTGT GTGAGGGAAC TGCCATCATG AGGTCTGACA 
51 AGTCAGCTTT GGTATTTCTG CTCCTGCAGC TCTTCTGTGT TGGCTGTGGA 
101 TTCTGTGGGA AAGTCCTGGT GTGGCCCTGT GACATGAGCC ATTGGC TTAA 
151 TGTCAAGGTC ATTCTAGAAG AGCTCATAGT GAGAGGCCAT GAGGTAACAG 
201 TATTGACTCA CTCAAAGCCT TCGTTAATTG ACTACAGGAA GCCTTCTGCA 
251 TTGAAATTTG AGGTGGTCCA TATGCCACAG GACAGAACAG AAGAAAATGA 
301 AATATTTGTT GACCTAGCTC TGAATGTCTT GCCAGGCTTA TCAACCTGGC 
351 AATCAGTTAT AAAATTAAAT GATTTTTTTG TTGAAATAAG AGGAACTTTA 
401 AAAATGATGT GTGAGAGCTT TATCTACAAT CAGACGCTTA TGAAGAAGCT 
451 ACAGGAAACC AACTACGATG TAATGCTTAT AGACCCTGTG ATTCCCTGTG 
501 GAGACCTGAT GGCTGAGTTG CTTGCAGTCC CTTTTGTGCT CACACTTAGA 
551 ATTTCTGTAG GAGGCAATAT GGAGCGAAGC TGTGGGAAAC TTCCAGCTCC 
601 ACTTTCCTAT GTACCTGTGC CTATGACAGG ACTAACAGAC AGAATGACCT 
651 TTCTGGAAAG AGTAAAAAAT TCAATGCTTT CAGTTTTGTT CCACTTCTGG 
701 ATTCAGGATT ACGACTATCA TTTTTGGGAA GAGTTTTATA GTAAGGCATT 
751 AGGAAGGCCC ACTACATTAT GTGAGACTGT GGGAAAAGCT GAGATATGGC 
801 TAATACGAAC ATATTGGGAT TTTGAATTTC CTCAACCATA CCAACCTAAC 
851 TTTGAGTTTG TTGGAGGATT GCACTGTAAA CCTGCCAAAG CTTTGCCTAA 
901 GGAAATGGAA AATTTTGTCC AGAGTTCAGG GGAAGATGGT ATTGTGGTGT 
951 TTTCTCTGGG GTCACTGTTT CAAAATGTTA CAGAAGAAAA GGCTAATATC 
1001 ATTGCTTCAG CCCTTGCCCA GATCCCACAG AAGGTGTTAT GGAGGTACAA 
1051 AGGAAAAAAA CCATCCACAT TAGGAGCCAA TACTCGGCTG TATGATTGGA 
1101 TACCCCAGAA TGATCTTCTT GGTCATCCCA AAACCAAAGC TTTTATCACT 
1151 CATGGTGGAA TGAATGGGAT CTATGAAGCT ATTTACCATG GGGTCCCTAT 
1201 GGTGGGAGTT CCCATATTTG GTGATCAGCT TGATAACATA GCTCACATGA 
1251 AGGCCAAAGG AGCAGCTGTA GAAATAAACT TCAAAACTAT GACAAGCGAA 
1301 GATTTACTGA GGGCTTTGAG AACAGTCATT ACCGATTCCT CTTATAAAGA 
1351 GAATGCTATG AGATTATCAA GAATTCACCA TGATCAACCT GTAAAGCCCC 
1401 TAGATCGAGC AGTCTTCTGG ATCGAGTTTG TCATGCGCCA CAAAGGAGCC 
1451 AAGCACCTGC GATCAGCTGC CCATGACCTC ACCTGGTTCC AGCACTACTC 
1501 TATAGATGTG ATTGGGTTCC TGCTGACCTG TGTGGCAACT GCTATATTCT 
1551 TGTTCACAAA ATGTTTTTTA TTTTCCTGTC AAAAATTTAA TAAAACTAGA 
1601 AAGATAGAAA AGAGGGAATA GATCTTTCCA AATTCAAGAA AGACCTGATG 
1651 GGGTAATCCT GTTAATTCCA GCCACATAGA ATTTGGTGAA AACCTTGCTA 
1701 TTTTCATATT ATCTATTCTG TTATTTTATC TTAGCTATAT AGCCTAGAAT 
1751 TCCATGATCA TGAGGTTGTG AGTATATCTC ATTCTTTCGT TGCATTTTCC 
1801 TAGGTGTGCT TACTCTCTTC TCTCACTTTG TGACACAAGG ACATGAATAC 
1851 ATCTAAATTT TCCTATTTCT GATATCACTG TTTCCATGAC GTCATTACTT 
1901 CTCTAACCTT AAGTGATAGG GTGACCTGCA ATATGCTGAT TCCTGGTGTT 
1951 TGCACAAACA CATGGATGTA AAGAAGTAAA AAATGTAAAA TTCACAAAAT 
2001 TCAGTAAACC ACACAAATCA ATGAAGCATT CTATGACATT AGCTTGTTAT 
2051 GAGTAACATA ATGATTTTTC TTTTTCAATT TAAATAAGCC CTTCTACATA 
2101 CCCAGCATTA CTGATCTCAG ACAATGAATT GCTAAAAATG ACGATAGGGC 
2151 ATTACACTCA GAATAGTTTG CTATATTTCC ACATACCTCA TCTAGATGTC 
2201 ATAGCCTACA TTTCTGCCAT CACTTAACTG ACATTTTTTG TGTGTTCTTG 
2251 ATGATAAATA GACAGTTCTT ATTATTGTCC TCAAATAATA AAAGAAACTG 
2301 AAATTTTCTT ACATAGAGAA AATGTCCATA AGATATTCAA GTTAAACAGA 
2351 TTATTTTGAG ATAAGTAACC ATTAGAAATA TGTGATTGTA ATTTCTGATT 
2401 TTATAAAATT TTAATTGATA GTACACTTGA TTTAAATGTC TATTCTTTAA 
2451 AATGATGAAT ACTCATAATT CTTATCTCTA TAATCAAAAG TATAATTTAC 
2501 TGTAGAAAAA TAAAGAGATG CTTGTTCTGA AAGTAAAAAA AAAAAAAAAA 
2551 AAAACACTGT CATGCCGTTA CGTAGCGTAT CGTTGACAGC CCACTGTCAT 
2601 GCCGTTACGT AGCATATCGT TGACAGCGAC ACTGTCATGC CGTTACGTAG 
2651 CGTATCGTTG ACAGCACTGT CATGCGTTAC GAGCGTATCG TTGACAGCAC 
2701 TGTCATGCCG TTACGTAGCG TATCGTTGAC AGCAAAACAC TGTCAGCCGT 
2751 TACGTAGCG (SEQ ID NO:l) 



FIGURE 1A 



Docket No.: CL000763 
Serial No.: 09/784,340 
Inventors: Ming-Hui WEI, et al. 
Title: ISOLATED NUCLEIC ACID MOLECULES- 
REPLACEMENT SHEET 



FEATURES: 

5'UTR: 1-37 

Start Codon: 38 

Stop Codon: 1619 

3'UTR: 1622 



Homologous proteins: 

Top 10 BLAST Hits 

CRA| 147000022596013 /altid=g 
CRA| 1000682322899 /altid=gi 
CRA| 335001098690982 /altid=g 
CRA| 18000005226060 /altid=gi 
CRA| 18000005155438 /altid=gi 
CRA| 18000005147363 /altid=gi 
CRA| 18000004953169 /altid=gi 
CRA| 18000005148770 /altid=gi 
CRA| 18000004903910 /altid=gi 
CRA| 18000005164461 /altid=gi 
CRA| 1000682327923 /altid=gi| 
CRA| 18000005219476 /altid=gi 



i | 10438148 /def=dbj |BAB15179.1| (AK. 
5802604 /def=gb|AAD51732.l| (AF1752. 
i|H436851 /def=ref |XP_003547 . 1 1 UD. 

5803213 /def=ref |NP_006789.1| UDP . 

4507821 /def=ref |NP_001068.1 j UDP . 

6175083 /def =sp | P06133 |UDB4_HUMAN . 

484383 /def=pir| |JN0619 glucuronos . 
|3153832 /def=gb|AAC95002.l| (AF064. 
j 4507819 /def=ref |NP_001067.1| UDP . 
|3426332 /def =gb | AAC32272 . 1 | (AF081. 
5881246 /def =gb| AAD55093 . 1 | AF180322 . 
|8134780 / def =Sp | Q9XT5 5 | UDBJ_MACFA . 



BLAST dbEST hits: 

gi | 10971169 /dataset=dbest /taxon=96.. 
gi j 11068678 /dataset=dbest /taxon=96.. 
gi | 679005 /dataset=dbest /taxon=9606 / 
gi | 3173232 /dataset=dbest /taxon=9606 
gi | 3134358 /dataset=dbest /taxon=9606 
gi | 10298020 /dataset=dbest /taxon=96.. 
gi | 11974507 /dataset=dbest /taxon=96.. 
gi | 11973717 /dataset=dbest /taxon=96.. 
gi | 12673874 /dataset=dbest /taxon=96.. 
gi | 10887798 /dataset=dbest /taxon=96.. 
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e-103 
6e-59 
2e-52 
2e-52 
2e-52 
8e-49 
le-47 
2e-40 
le-29 
4e-26 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 

gi | 10971169 Kidney-hyper nephroma 

gij 11068678 HepG2 cell line 

gi | 679005 Liver 

gij 3173232 Kidney 

gij 3134358 Kidney 

gij 10298020 Hepatocellular carcinoma 

11974507 Normal pigmental retinal epithelium 
11973717 Normal pigmental retinal epithelium 
gi | 12673874 Kidney hypernephroma 
gij 10887798 Kidney 



gi 



Expression information from PCR-based tissue screening panels: 
Human fetal liver 
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1 MRSDKSALVF LLLQLFCVGC GFCGKVLVWP CDMSHWLNVK VILEELIVRG 

51 HEVTVLTHSK PSLIDYRKPS ALKFEWHMP QDRTEENEIF VDLALNVLPG 

101 LSTWQSVIKL NDFFVEIRGT LKMMCESFIY NQTLMKKLQE TNYDVMLIDP 

151 VIPCGDLMAE LLAVPFVLTL RISVGGNMER SCGKLPAPLS YVPVPMTGLT 

201 DRMTFLERVK NSMLSVLFHF WIQDYDYHFW EEFYSKALGR PTTLCETVGK 

251 AEIWLIRTYW DFEFPQPYQP NFEFVGGLHC KPAKAL PKEM ENFVQSSGED 

301 GIWFSLGSL FQNVTEEKAN IIASALAQIP QKVLWRYKGK KPSTLGANTR 

351 LYDWIPQNDL LGHPKTKAFI THGGMNGIYE AIYHGVPMVG VPIFGDQLDN 

401 IAHMKAKGAA VEINFKTMTS EDLLRALRTV ITDSSYKENA MRLSRIHHDQ 

451 PVKPLDRAVF WIEFVMRHKG AKHLRSAAHD LTWFQHYSID VIGFLLTCVA 

501 TAIFLFTKCF LFSCQKFNKT RKIEKRE (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASNJ3LYCOSYLATION 
N-glycosylation site 

Number of matches: 3 

1 131-134 NQTL 

2 313-316 NVTE 

3 518-521 NKTR 

[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 

Number of matches : 2 

1 67-70 RKPS 

2 340-343 KKPS 

[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 6 

1 3-5 SDK 

2 120-122 TLK 

3 169-171 TLR 

4 200-202 TDR 

5 435-437 SYK 

6 520-522 TRK 

[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



Number of 


matches : 


9 


1 


62-65 


SLID 


2 


141-144 


TNYD 


3 


204-207 


TFLE 


4 


243-246 


TLCE 


5 


258-261 


TYWD 


6 


296-299 


SSGE 


7 


297-300 


SGED 


8 


419-422 


TSED 


9 


435-438 


SYKE 



[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 
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Number of matches : 2 

1 122-130 KMMCESFIY 

2 136-143 KKLQETNY 

[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches : 4 

1 19-24 GCGFCG 

2 276-281 GGLHCK 

3 373-378 GGMNGI 

4 377-382 GIYEAI 

[7] PDOC00009 PS00009 AMIDATION 
Amidation site 

338-341 KGKK 

[8] PDOC00359 PS00375 UDPGT 
UDP-glycosyl transferases signature 

354-397 WI PQNDLLGHPKTKAF I THGGMNGI YEAI YHGVPMVGVPI FGDQ 

[9] PDOC00804 PS01047 HMA 
Heavy-metal-associated domain 

12-41 LLQLFCVGCGFCGKVLVWPCDMSHWLNVKV 



Membrane spanning structure and domains; 



Helix Begin End 

1 5 25 

2 157 177 

3 181 201 

4 377 397 

5 491 511 



Score Certainty 
1.802 Certain 
0.765 Putative 
0.779 Putative 
0.735 Putative 
1.931 Certain 
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BLAST Alignment to Top Hit: 

>CRA| 147000022596013 /altid=gi 1 10438148 /def =dbj | BAB15179 . 1 1 
(AK025587) unnamed protein product [Homo sapiens] 
/org=Homo sapiens /taxon=9606 /dataset=nraa /length=449 
Length = 449 

Score = 931 bits (2381), Expect = 0.0 

Identities = 448/449 (99%), Positives = 448/449 (99%) 



Query: 


79 


MPQDRTEENEIFVDLALNVTiPGLSTWQSVIKLNDF 


138 






MPQDRTEENEIFVTJLALNVLPGLSTWQSVIKLNDFFv^ 




Sbjct: 


1 


mpqdrteeneifvdlaliwlpglstwqsviklndffvf:irgtlkmmcesfiynqtlmkkl 


60 


Query: 


139 


QETNYDVML IDPVI PCGDLMAELLAVPFVLTLRI SVGGNMERSCGKLPAPLSYVPVPMTG 


198 






QETNYDVML IDPVI PCGDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLS YVPVPMTG 




Sbjct: 


61 


QETNYDVML IDPVI PCGDLMAELLAVPFVLTLRI SVGGNMERSCGKLPAPLSYVPVPMTG 


120 


Query: 


199 


LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 


258 






LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 




Sbjct : 


121 


LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 


180 


Query: 


259 


YWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIVVFSLGSLFQNVTEEK 


318 






YWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIVWSLGSLFQNVTEEK 




Sbjct: 


181 


YTOFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIVVFSLGSLFQNVTEEK 


240 


Query: 


319 


ANI I ASALAQ I PQKVLWRYKGKKPSTLGANTRL YDWI PQNDLLGHPKTKAF I THGGMNG I 


378 






ANI IASALAQI PQKVTjWRYKGKKPSTLGANTRLYDWIPQNDLLGHPKTKAFITHGGMNGI 




Sbjct: 


241 


ANI I AS ALAQ I PQKVLWRYKGKKPSTLGANTRL YDWI PQNDLLGHPKTKAF I THGGMNG I 


300 


Query: 


379 


YEAIYHGVPWGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 


438 






YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 




Sbjct: 


301 


YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 


360 


Query: 


439 


NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLTC 


498 






NAMRLSRIHHDQPVlCPLDRAvTWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLL C 




Sbjct: 


361 


NAMRLSRIHHDQPWPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLAC 


420 


Query: 


499 


VATAIFLFTKCFLFSCQKFNKTRKIEKRE 527 (residues 79-527 of SEQ ID NO:2) 






VATAI FLFTKCFLFSCQKFNKTRK I EKRE 




Sbjct: 


421 


VATAIFLFTKCFLFSCQKFNKTRKIEKRE 449 (SEQ ID NO: 4) 





>CRA| 1000682322899 /altid=gi | 5802604 /def =gb |AAD5 1732 . 1 1 (AF175221) 
UDP glucuronosyl trans f erase UGT2A3 [Cavia porcellus] 
/org=Cavia porcellus /taxon=10141 /dataset=nraa 
/length=530 
Length =530 

Score = 795 bits (2030), Expect = 0.0 

Identities = 377/530 (71%), Positives = 435/530 (81%), Gaps = 3/530 (0%) 

Query: 1 MRSDKSALWLLLQLFCVGCGFCGKVLWPCDMSHWLNVKVILEELIVRGHEVTVLTHSK 60 

M K A LLL L C G GFCGKVLVWPC+MSHWLN+K +LEEL+ RGHEVTVLT S 
Sb j c t : 1 MAPGKLASAVTiLLLLCCAGSGFCGKVXWPCEMSHWLNLKTLLEELVKRGHEVTVLTLSN 6 0 

Query: 61 PSLIDYRKPSALKFEWHMPQDRTEENEI FVDLALNVLPGLSTWQSVIKLNDFFVEI 117 

IDY + A FEV+ +P D+ I F++LA+NV+P + WQS L FFV+I 

Sbjct: 61 NLFIDYNRHPAFOTEVIPVPTDKNMSENILNEFIELAVT^ 120 
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Query: 


118 


RGTLKMMCESFIYNQTLMKKLQETNYDVMLIDPVIPCGDLMAELLAVPFVLTLRISVGGN 


177 






L + C + +YNQ+LMKKL+++ YDV++ DPVIPCG+L+AE+L VPFV L+ S+G 




Sbjct : 


121 


TEDLGLNCRNTVYNQSLMKKLRDSKYDVLVTDPVIPCGELVAEMLGVPFVNMLKFSMGHT 


180 


Query: 


178 


MERSCGKLPAPLSWPVPMTGLTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKA 


237 






+E+ CG+LPAP SYVPVP+ GLT RMTF+ERVKN + SVLF FWIQ YDY FW++FYS+A 




Sbjct: 


181 


IEKYCGQLPAPPSWPVPLGX3LTTRMTFMERVKNMVFSVLFDFWIQQYDYKFWDQFYSEA 


240 


Query: 


238 


LGR PTTLC ETVGKAE I WL I RT YWDFEF PQPYQPNFEFVGGLHCKPAKAL PKEMENFVQS S 


297 






LGRPTTLCE +GKAEIWLIRTYWDFEFP+PY PNFEFVGGLHCKPAK LPKEME FVQSS 




Sbjct: 


241 


LGR PTTLC E IMGKAE I WL IRT YWDFEF PRP YL PNFEFVGGLHCKPAKPL PKEMEEFVQS S 


300 


Query: 


298 


GEDGIWFSLGSLFQNVTEEKANI IASALAQIPQKVLWRYKGKKPSTLGANTRLYDWI PQ 


357 






GEDG+WFSLGS+ +N+TEEKAN+ IASALAQI PQKVLWRYKGKKP+TLG NTRL+DWIPQ 




Sbjct: 


301 


G EDGWVFSLGSMVKNLTEEKANL IASALAQI PQKVLWRYKGKKPATLGPNTRLFDWI PQ 


360 


Query: 


358 


NDLLGHPKTKAF I THGGMNGI YEAI YHGVPMVGVPI FGDQLDNI AHMKAKGAAVE INFKT 


417 






NDLLGHPKTKAFITHGG NGIYEAIYHGVPMVG+PIF DQ DN+A MKAKGAAVE+N T 




Sbjct: 


361 


NDLLGHPKTKAF I THGGSNGI YEAI YHGVPMVGMPI FSDQPDNLAGMKAKGAAVEVNMNT 


420 


Query: 


418 


MTSEDLLRALRWITDSSYKENAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSA 


477 






MTS DLL ALRTVI D +YKENAM+LSRIHHDQPVKPLDRA FW+EFVM HKGAKHLR A 




Sbjct: 


421 


MTSADLLGALRWINDPTYKENAMKLSRIHHDQPVKPLDRAAFWVEFVMHHKGAKHLRVA 


480 


Query: 


478 


AHDLTWFQHYSIDVIGFLLTCVATAIFLFTKCFLFSCQKFNKTRKIEKRE 527 (residues 1- 


527 of 


SEQ 


ID NO: 2) 








AHDL+WFQ++S+DVIGFLL CVA+AI L TKC LFS Q F K K K+E 




Sbjct: 


481 


AHDLSWFQYHSLDVIGFLLACVASAILLVTKCCLFSFQNFIKIGKRIKKE 530 (SEQ ID NO: 5) 



Hxnmer search results (Pfam) : 

Model Description Score E-value N 

PF00201 UDP-glucoronosyl and UDP-glucosyl transferas 962.0 1.5e-285 1 

Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00201 1/1 24 525 .. 1 507 [] 962.0 1.5e-285 
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1 TTCTAGAGGG TTGGAACAAC TTTTCCCTGA TACATTGCAT TTTTTTGATA 
51 CCTTCAGTAC ATGTTAAACT GGCAACCACC AGTGAACTTT ACTCTTAAAA 
101 TATTAATTTT TAACTTCTGT GCTTATATTG TCATTTCAAC TCCTTGCTTA 
151 GTAACTACAA AACCATTGCA GATCAGTGTG TGAGGGAACT GCCATCATGA 
201 GGTCTGACAA GTCAGCTTTG GTATTTCTGC TCCTGCAGCT CTTCTGTGTT 
251 GGCTGTGGAT TCTGTGGGAA AGTCCTGGTG TGGCCCTGTG ACATGAGCCA 
301 TTGGCTTAAT GTCAAGGTCA TTCTAGAAGA GCTCATAGTG AGAGGCCATG 
351 AGGTAACAGT ATTGACTCAC TCAAAGCCTT CGTTAATTGA CTACAGGAAG 
401 CCTTCTGCAT TGAAATTTGA GGTGGTCCAT ATGCCACAGG ACAGAACAGA 
451 AGAAAATGAA ATATTTGTTG ACCTAGCTCT GAATGTCTTG CCAGGCTTAT 
501 CAACCTGGCA ATCAGTTATA AAATTAAATG ATTTTTTTGT TGAAATAAGA 
551 GGAACTTTAA AAATGATGTG TGAGAGCTTT ATCTACAATC AGACACTTAT 
601 GAAGAAGCTA CAGGAAACCA ACTACGATGT AACGCTTATA GACCCTGTGA 
651 TTCCCCGTGG AGACCTGATG GCTGAGTTGC TTCCAGTCCC TTTTGTGCTC 
701 ACACTTAGAA CTTCTCTAAG AGGCAATATG GAGCGAAGCT GTGGGAAACT 
751 TCCAGCTCCA CTTTCCTATG TACCTGTGCC TATGACAGGA CTAACAGACA 
801 GAATGACCTT TCTGGAAAGA GTAAAAAATT CAATGCTTTC AGTTTTGTTC 
851 CACTTCTGGA TTCAGGATTA CGACTATCAT TTTTGGGAAG AGTTTTATAG 
901 TAAGGCATTA GGTAAGACAC TTTTGTTTTA TTTTTAATTT AGTTATCAAA 
951 AGAAATATTT TTAAAAATTG TCATACATTG TCTATGACAT ATATATGCAG 
1001 GTCAATGAGT TTTTTTAGAA AATGTTGTAG CTGTTTTTCA TAAAGAAAGT 
1051 GTATTTGTTC TAAGCGTAAG ATAACCTACT TTCTTAATAC CAGTAATATA 
1101 CTTAAAAATG ATCATCAATA ACTAAGAGAT TATATTTTGT ATTTCCTCCA 
1151 AATAGCGCAA ATCAACATCA CATATTTTTG AGAATC AC TG ATTGTTAGTC 
1201 TGAATGTTAT AGAATTTCTA TTGAAATAAA ATGCTAATCA TTATTTTCTC 
1251 TCTCATCATG TATTTAAGAA AATCTTCAGA AGGTCTTCTT TGAATTAATT 
1301 TTTCAAGAGT CATTAAATTG AACATTTTCT AGAATTCTTT AATTTCTTAG 
1351 GTGATTACTT CACAAAAACT TGAAAAAATA TTATAAAAAG TTAAAAAACT 
1401 TACGGTCTTG TGGGGCATAA GATAGTAGAA TTTTTACTTT ACTGATATAC 
1451 ACCTATTTGA CTTATTTTTA TTTCTTTGCT TTACTGATAA AAAGTTGTTT 
1501 TGCTTTGCAA TTTTCATATA GTTGTGATCA GAGCTGGTCA ATGCAAGACA 
1551 TGTTTTTATC CAAATATGTT TGAGAATTAT GTAGAAACAT GAAAAAAGGT 
1601 ACAATTATAT CCGACACTAA AATATTGTTT AA.TGTATTCC AACGAATTCT 
1651 TATGCATAGA CTGTTTCACA GAACTAATAT TCAGAGGATC CCAGTTCAAA 
1701 TGTCCTTAGC CTTAGACATG ATTTGAATTT ACATATATTG ATTTGCTTTA 
1751 AATAATTTTC CATTCAGTAA GCTGTGCCTA GCTGCAGATA GCCTACCAGG 
1801 CTTTATGGAT CTAGGTAAAC AATACAAATC TCTTGGCCTC AAGTCTACAT 
1851 TCAGATATTA ATTTAAAGGG GTACAGCTAT ATAGAGGTCA CTGGCAAATT 
1901 TTGGTAAAAT AGGATTATAG TAAAAGCCCC CTGACAAGAT TGAAATTTAA 
1951 AATAAAACAA AAGTGTTATC AAAGGGGTGA AAGAGCATTT TCCAATAAAC 
2001 AAAAGTGGGT TCTGGCCATG CATTCAGAAA TTCCCCAACA ATTCTTTAAA 
2051 AATCATGGAG CAGCTTGATA TATAAGAAAT TCATTTAATA ACTATATTTA 
2101 TTATGTAGCT CCAACTTACT AAATTATTGA TTATTATATA TTTTATAGAA 
2151 TTATCTATTG TGAGTCTAAA TCAAGAGTAT ATATTCAAAC AACTATAGGA 
2201 AAAGGGATAT CAGTCAATTT CAATTCAAGG ATTTATTTCC AT AAGTGC TT 
2251 ACGCACAGGT GTATTTCATT TTATTATACA TTGCTTTATT GTCCTTCACA 
2301 AAAATTGCAA TTTACAAATT AAAGGTTTTT GAAAACCTTG AATCAAGCTA 
2351 ATCAATTTGG CGTAATATTT CCAACAACAA GTGTGTACTT TTGACTCTAT 
2401 CACATATTGG CATTTATCAT GCTTTTTCAA ATTTTTCATT GTTATATCTG 
2451 TTACGGTGAT CTGGGATCAG TGTTCCTTGA TGGTTACACG TTTATTAGCT 
2501 TGGGGGCACC TTGATGTGTT ACAATATAAG ACAGCAAACT TAATTATAAA 
2551 TGTTGTGCAT GTACTAACTG CTCCGCTGAT TCGTTTCCCC ATCCCACTTC 
2601 TTCTTAGGCC TCCCTATTCC CTGAGACACA GTAATATAAC ATACAATGAC 
2651 TTCTAAATGT TCCAGTGAAA AGAAAAGTAG CAGGTCTCTC AATTTAAACC 
2701 AAAAATATAA AGGAATAAGT TTAATGAGTA CTATAGTTTA GATATGGTTT 
2751 GCTTGACCCT ACAAAATCCT GTGTTGAAAT TTGATCACCA ATATTGGAGG 
2801 TGGGGCTTGA TGGGAAGTGT TAGGGTCATG AGGGTAGATT CCTTATGAGT 
2 851 ACATTAATGC TCTCCCTGGG GAAATGGGTG AGTTCGTTCT CACTCTATTA 
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2901 GGTCCCAGGA GAGATAATTA TTAAAAAGAG CCAGGAACAT CCACCTTCTT 
2951 TCTCTTGCAT ATCTCTCATT ATCTGATCCC TGCACTTGCT GGCTCCCAAC 
3001 ATCTTCTTCA ATGAGTGGAG GAAACCAGAG GTCTTCACCA GACACAGATG 
3051 TTGGTGCCAT GCCTCTTGTA TACCCTGAAG AATTGTGAGC CAAATAAAAA 
3101 CCTTTTTCTT TTACAAATTA GACAGCCTCA GTTATTCCTT TGTAGCAACA 
3151 AAAAAAGCCT GGGACAGGCC AAAAACTACA CCATTGCACC AAACAGTTAA 
3201 ACAAGATGTG AGTGCAAAGG AAAAGTTTTT GGAGGAAATT AAAAGTGCTA 
3251 CTCCAGTGTA CATACAAATG ATAAGAACAA ATAACCATTA TCAGTGCTGA 
3301 TATGGAGAAA ATTTTAGTTG TCTGGAGAGA AAATCAAATT AGCTAGCCAG 
3351 CTGCAGTGAT TCATATCTGT AATCCCAGTA ACTTGGGAGG CTCAGGTGGG 
3401 AGAACGGCTT GAGCCCAGAA GTTTGAAGTC CAAGGCTGCA GTGAGCTATG 
3451 ATTGCTCCAC TGCACTCCAA CCTAGGTGAT AGAGCAAAAC CACTACCAAA 
3501 AAAAAAAAAA AAAAAAAGAA GAAAAAGAAA AGAAAAAAAA TTAAACCAAC 
3551 CACAACATCA CCTTAGGTTT TGGCATTAGC TAAAAACTAA TACATAGTAA 
3601 AGCGTTAACT ATTCAATTGC ATGAAACCTC AGAGAGGAGA GGAAGATGCA 
3651 GAAAAAAAGA CTGAAGCTAG TAGAGGTTGA CTAATGAGGT TTACAGGAAT 
3701 AAACTGCCTA CATGATGCAA AAGTTCAATG TGAAGCAATA GGAAGTCATG 
3751 CAGAAGACTT AGCTAATATA CTCAGTAAAT GTGGCTACAG TAAACAAATG 
3801 ATTTTCAATG TAGACCTAAC AGCCTTCTGT TGGAAGAAGA TGCCATTTAA 
3851 AACTTTCATA GCTAGAGAAG AGAAGTCAAT GCTTGTCTCT GAAGCTACAA 
3901 AAAACAGGCT GAATCTCTTG TAGTGGCTAA TGCAGCTGAT GACAAAGGTA 
3951 AAGCCAATGC CCATTTACTT TTTGTAATAA TTATAGAGGA CTCTTAATAA 
4001 TTATGTTAAA TCTACTTTGC CTGTGTTATA TCAATGGAAC AACAAAGCCT 
4051 GGATGATATC ACATTGGTAT ATGACATGGC TTATTGAATA TTTTAAGCAC 
4101 ACTGTTGAGA CCTATTGCTC AAAAAAGAGG ATTCCTTTCA AAATATTGCT 
4151 GCTCATTGAC AATTCACATG GTCAACAAAG GGCTCTGATT AAGATGTACA 
4201 GATATTAATG TTTGCCTGCT TGCTATTATT ACATCGATCT TACATGCCAT 
4251 GGATCATATA GCCTTGACTT TCAAGTCTTA TGTAAGAAAT ATATTTTGTA 
4301 AGGCTATAGC TCTTACTAAT GGGGAAAGTA TATTGAAAAC CTTTTCAAAA 
4351 GGATTTTTCA TTCTAGATTC CATTAAGAAC ATTCATGGTT CATGAGAGGA 
4401 AGTCAAGATA TTAACATTAA CAAGAGTTTG GAAAAAATTT GATTCTAACT 
4451 CTCCTGGATG ATTTTGAGGG ATTGAAGACA TCATGTGAAG AATTAAC TGG 
4501 GGATGGGGTG GTCATGAAAA AATAAATAGA ATTATAAGTG GGCCTGAAGG 
4551 TTTGTCTAAA TTGC TATAAT ATCATGATAA AACTAAAACC TGTAAAACCG 
4601 GTGAGGAGGT GCTTTTTAAA CAGTTACTTT TTATAGATGA ACACAGAAAT 
4651 TGGTTTTGTG AGTTGGAATC TTCTCCGAGT G AAAATGC T A TGAACATTGT 
4701 TGAAATGGCT ACAAATGACT TAGAATATTA CACAAAATTA GTAGATAAGG 
4751 CAGCATCAAG GTTTGAGAGA ATGGACTCAA ATTTTGAAAG AAATTCTACT 
4801 ATGGGTAAAC TGC TGTGAAA CATCATCATA TGCTACAGAG AAATCTTTCA 
4851 TGAAAAGATG AGTCAATTCA TGCAACAATC TTTGTTGTCT AATTTTAAAA 
4901 ATTGTCCAGC TGCCCTGATC AATCAACAGT AATCAGCACT GAGGCAAGAC 
4951 CCTACACCAG AAAAAATAAA AATAAAAAAC CTCACTTGCT GAAGACTCAG 
5001 CTTATTATTA GCACTTTTTA GCCATACTTT TAACTAAGGT ATGTGCATTC 
5051 CTTTTTAAAC GTGATGATAT TGCACAGCTA ATAGCCTACA AGGTATGGTT 
5101 AACATAACTT TTATATGTCC TGGGACCCAA ATTTGTGTGA ATCACTTTAT 
5151 TGACATATTC CTTTTATTGA GATGAACTGC AACTTATCTT GCAATATCTC 
5201 CAAGATATGT GTGTATGGCA TTTCAAATAA GATGTGAAAT TATTTTATTA 
5251 GTATAAAAAG CAAATTTAAT TTTCTTTCCT TTGATCATCT TTATCCTTGT 
5301 TACTGTGTAT TTATCCTTTA AACATTGAAT GACTCCAATT GTTTAAAACT 
5351 GAGTCTTTCT TAAATGAGTC CTAATATCAT AGTAATTAAA ATCACCTACA 
5401 AGTTGGTAAT GCAGGCAGCA TGTGAGGCAC AGAAAACAAC AAATTTATAA 
5451 GACATAAATG CATTTGCTTG GAAGCTGAGA GAAGGCTCTA TTCTAATTTC 
5501 TGATAACTTC AAACTGAGTA TCTTCAGTAA AATTTATTCA CTATCAAATT 
5551 CAAGGCGTTT GGATTTATGA CC TAGGAAAA AACTTCAAAC ATTAAAATGT 
5601 GATGACCTTA AAAAGAGGCT CTCCACACTA TGGTGTATAA CACCACCAAC 
5651 TTTGATTAGA ATTTTAAAGA GAAACAAATT CTCTTATGGA GTTTATCTTT 
5701 TTATCACTTG CAAAATATGT TTTTGTAAAG AGATACTAAT TACTTAGTTA 
5751 TTTGTAGTTA GCCATTCTTC TGATTAAAAA CCTAAAATTA AATCTTGAAA 
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5801 ATGTGTTTTC CTTCAAAACA CATCATTTGA GAGAAACACT AAAGTAAGTG 

5851 TATGATTATC ATAGCATGTA CATAGGTGCT TCACAACCCA AAAAGAATAT 

5901 TGTCATGGGT AAGAATCAGT AAAGGAATTT CTCCTAATAA AACAGTAGCC 

5951 TATTAATTAA AGTAATGATA TGCAATACAG CAAGTTAAAG GGAACTGATC 

6001 CTGGTGGGAT TATTGAAAGA TATACCCTTG ACTATAGATT AGAAAATACA 

6051 GAGATGTTAT TTAGTGAAGA TATTGTGGTA CTCATTTATC ATCTGCAATT 

6101 CACTTGCAGA GGAAAAAATG AGTAATAAAT TCATTTGCAT TTTGGATTTG 

6151 TGTCTTTAAG TTGTGAAAAT AC AC TTAAAT ATAACCATCT GTCCTTTGCT 

6201 CCTTCCTTCC TTCCTTCTTT CCTTCCTTCC TTCCTTCTTT CCATCCTTCC 

6251 CTCCCTCCAT CCTTCCTTCC TTCCTTTCTT CCTTCCCTCC TTTCTTTCTC 

6301 TGTCCTTCCT TCTTTTTTCC TTTCTTTCTT TTTTCTTCTT TATTATTTCA 

6351 TTAATTCCCC CTTCCATTTG ACGTCTAAAA GCCATGTTGT TCTAGAGGAC 

6401 TTAAACTTAT TTTTTTCTTA ATAGCTTACT GAAAAATTAG TGATACAATT 

6451 TTTTATTTGA ATTGTATGCT AATTCATTCT GTTATTTCTT TTATTGAGGA 

6501 AGGCCCACTA CATTATGTGA GACTGTGGGA AAAGC TGAGA TATGGCTAAT 

6551 ACGAACATAT TGGGATTTTG AATTTCCTCA ACCATACCAA CCTAACTTTG 

6601 AGTTTGTTGG AGGATTGCAC TGTAAACCTG CCAAAGCTTT GCCTAAGGTA 

6651 GGACTATTGT ATTAAGGAAT ATTATGTACT TTATGACATG ACTTGTTTTC 

6701 CCTTGAAAGA TTACAACCTT AGTTATAGAA GGATGATGTT GAATGTCGTC 

6751 TGTTTGCAGC TCCATATTTA TTTTCCATGC C AC AGGGGC T CTTATAGGTG 

6801 ATTATATGTC TTTTCGGTAT TATATTGAGA AAGTAGGCAG AAGAATTTCA 

6851 TGATTAGAAT AGATTTTAAA ATACTAGTAT TACAATAGTT TGGATAATAA 

6901 ATTGAATTAA TAGGGAATTG GAGCCATGAA GATCACTAAA AAGAATGCTC 

6951 TAGCCTTTCT CACAATCAAA TTGGGCTTAT GAACAAGGAT ATTTGTCATG 

7001 ATAGTACAGA AATAAGCATA TTTTCATGAG ACATATTGGA TATATTCCAC 

7051 AGGAGTTGGT GAGTGAGAGA AAATAAGTGA TGAAGGAAGA CAAAGAATAA 

7101 AAGAAAATTT CAATAAATGG AAAGTTTAAG TGTTTAATGA TAGTGATGAC 

7151 TTTTACTCAA ATAAGTGCTT AGAAGTCATC TTGTTTGTGA TTTATATGAT 

7201 GAATTCTGTG TTGTGACTAT CCACTTTGAG CTCGTGAGAA TGTTAGGTGA 

7251 GGTTTAATAA AAGCCATTTG AGAAAAACAA GGTTTCAACC TCTGTGGACA 

7301 GAAATCTAAA TATCGATAGT TATCAGGACA AAGTAGAGCT CATAGAAATA 

7351 ATTTTGCAGC CTGCAGGTTT GTTTTGGAGT GAAAATAAAA TTGTATACTA 

7401 TATTCCTAAA TCATCAGAGG AAAAAATTTA TAGTTCAAGG AATGTTGAAA 

7451 GAAACAATAT TGAGAAGTAA AAGTGAGTAA TAGTTGTTAT AGTTTTTTAA 

7501 TAGTTTTGTA AGTATGTCTT GAGTTCACTG TCCCAAAAGT GGCTATTAGC 

7551 TCTAGCCTTG ACCTGACAAG GTTCTAGGAT ATTTAGTCAT GGATGTTCAT 

7601 AATCTACCTC TTACGGGATA CTTTTTATTC TGATGAACAG CCTAATGCCT 

7651 AAGTGTGCAA TCTATACCAA GATTGTTCTT ATAGGGAACT TGTTTACACT 

7701 GGAAGACACC ACTGTGTCTC TTGTATGACC TATGTCTTCT TTATCCCTAC 

7751 AAAGGTAACC ACATTATAGG AAACCCTGAC AAGGCCAGAT GTTATATTTG 

7801 TGTTGGTCAA GTGAGAAAAC ATGGGAGAAA CTTAACCAAA CACATAAAAT 

7851 AACAGAAACA GTCTTCTTTG ACCATTTCTA GAGAAAAGAG TTCAGCATCC 

7901 CTTGTAAGGC CACTAGGAAG AAGAAAATTC TCTGGGAAAA GCACATTCAA 

7951 CCAATGAATG GAGACCAAGA AAGAGAGTGA GGGATCTATG TGCCAAAATG 

8001 TTAACTGGGA TCCAGGGTGT TACCTAGGTG GGTTTCCAAT GGGGAACTGT 

8051 AATTGGTAGG TTTAATGCAA GCAGGCACAA AGTCCATGGA GGCATTCTGA 

8101 GACTGAAAGA TAGTCACTTT GGCATATCTG CACAGAATCT GATCAGTGAT 

8151 TCAAGCCCAA GTAGGCTGTA TCTAGTTGTC CTATAGGGTG GTTACCAGGA 

8201 GGCAGTGTGT AAGTAAAAAT CCTGACTGAA CACATTGAGG AAATGGAAGG 

8251 AGGTGGAAGA TTTTAAACGG TGTCAGTGTT GACTAAGACC TGCTTCTGGT 

8301 ATGGAAAATT CAACTTATAT TTTAAATGCA TAGCCAGACA ACATAAAATT 

8351 ATAAGAATTT ACCACAATAG CTATGGTAAC AATACTGGGT TTACCTATTA 

8401 CTACAGAGTG AAAAGAAAAC CCTCATTTCC CATTTTATGG AAATATAATC 

8451 AAAATCCTAT AAGGAAGGTT TCAGAGCCAG TAGGATTTCC AGAAAAATTA 

8501 TTGGTTTTAT AGTAAGATGT GTATTGATGA ATATAATTTT ATTTATTAAT 

8551 TATTAATATC ACTTTACTTA CCAGGAAAGT TATACCAGAA AACCAAGCTC 

8601 TCTTAAGCCA TGGCATCTGT ATCTAAAATA GAAATACAGA AGGAGAGCTG 

8651 ACAATTTCCA TCATTCTCTA GGTAATCTCC CATGCCATTC TACCCTTTAT 
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8701 TCCCACACTC CCAGTTTTAC ACACACACAC AAACACACAC ACACAAACAC 
8751 ACACTCATAG AAATAATCAT AGAAGACATA TTTTTAAAAA AGTTAGATCC 
8801 ATACAGTAAT AATTTATTAG GTAAAAGCTT TTGTGCTGAT AATTTTACAA 
8851 GTTTAATTGA GATATATTTT AGGGCTGTCT T AC AC T AAAT ATTTATTTTT 
8901 ATTTTTTAAA TTTGACATGT AATAATTGCA CATGTTTAAG AGAAATGCTG 
8951 TGGTATTACA ATACATTTAA ATGTTGTGTA ATAATTACAT CAAGATAATA 
9001 AACCCATCAT CTAAATATTT ATCATTTCTT TGTGGTGATA ACATTCAAAA 
9051 ACCTCCTTTC TGGCTATCTT GAAATATGTA ATACATTACT ATTAACTATA 
9101 GTTACCCAAC AACTTAATAT AATAACAGAA CATATTCTTC CAAATTTAAA 
9151 CGTTGTATCC ATTGATCCAC CATTTCTCAT TGCCCTCCCT ACTATCTCTT 
9201 CAGCCTCTAG TAACCACAAT TCTACTCTCT AATTATATTA TGAATGCATT 
9251 TTTTGATTCC ACATATAAGG GATACCATGC TATCTCTGCC TGGATTATTT 
9301 CAGTTAACAT TATGCCCTGG AGGTTCATTC ATGTTTCTAC AAATGACAGG 
9351 ATTTCATTCT TTTTTTTCCA ATATATATTT AATGAAATGG ATATATATAA 
9401 ACATTGGAAA ATGTATATAT ATATATATAT CTCCAGTGGA ATGCTATTGA 
9451 GCTATAAAAA AGTTAATATA TAATAGAAAT AAAGCTTATA TATATCTAAT 
9501 GGAATGGATA TATATATATA ATGGAATAGA AATATATATC TATACATATA 
9551 AACACACGCA ATATACATAT CCATTTCATT GCATATATAT ATATATAGAG 
9601 AGAGAGAGAG AGAGATATTT TCAAATGTGT GTATATATAT CCAATGGAAT 
9651 GGACATATAT ATATGTATAT TTTTTCCATA TTTTCTTTAT GTATTTCTTC 
9701 ATTAATGGAT GTTTAGGTTG ATTCATCCCT TGGGTATATG AATAATGTTG 
9751 ATGTAAACAT AGAAGGACAG ATATCTCTAT GACTTCTTAG TTTATTTAAA 
9801 TATACACCCA GTAATGGAAA TGCTGTATAA TATGGTAGTT CTATTTTCAT 
9851 TTTTTGAGGA ACTACCATAC CGTTTTCCTT ACTAATTGTA CTAATTTGCA 
9901 TTTCCCTCAA CAGTTTATAA AAGATCTTCT TTCTCTGCAT ACTTTCTAGC 
9951 ACTTGTTATT TTTGCCTTTT GATAATAGCC ATAACAGGGG TGATGTGATA 
10001 TCTCATTGTA GTTTTGATTT GCATTTCCCT GATGATTAGT GATTTTGAGC 
10051 ATTTTGTAAT TATACTTCTT AGTCACTGAT AGTCTTCTTT TGAGAAGTGT 
10101 CTATTCAGGT CTTTTGCTTA TTTTTTAATC AAATTAGTAA TTTATTTTTA 
10151 TTGACTGATG TGACTTCTAT GTATATTTGA GATAGTAACT TATTGTCAGA 
10201 TTCATAGTTT GCAAATATTT TTCATGTTGT GAATTGTCTC TTCACCCTGT 
10251 TGTTTGCTTC ATTTTCTCTG CACAAGCTCA ATGCTTTGAT ATAACCCATT 
10301 TATCTACTTT TCCTTTTGTT GGCTGTGCTT CTGAAGTCCT ATCCAAAAAA 
10351 ATCCTTGCCT AGACCAATGT CACAAATCAT TCCTCCTACA GTTTCTTCTA 
10401 GTAGTTGTAT AATGTTTGGC CTTATATTTA ACTTTGTAAT TCATTTTTAC 
10451 TTACTTTGTA TATGGTGAGG GATAGAGGTC TAGTTTCATT TTCTGCATGT 
10501 GGATATGCAG TTTTCCTAGC ACCATTTAGT GAAGAGGTTG CCTTTTTTCT 
10551 ATTATGTGTT CTTGGCACCT TTGTCAAAAG TCAGTTAGCT GCTATATTCC 
10601 TCCATTTGTG TTGTTATAGA GGAACACATG AGACTAGCAA ATTTATATAT 
10651 CAAATAGAAT TATTTGAATG ATAGTTCTGC ATACTGTACA AGAAGCACAG 
10701 CACTGACTTC TGCTTGGCCT CTGGTAAGGT TCTCAAGATG CTTCCACTTG 
10751 TGGTAGAAGG CAAACATGAG CTGGTATATG CAAAGGTCTC ATGACAAGAG 
10801 AGGAAACCAT AAAGAGGGGA TGTGAGGGAG TGCCAGGTTT TGTAAAACAA 
10851 CTAGCTCTTC TGGGAACTAA TAGAGTAAAA ATTCGCCTCC CAGGCAGGGG 
10901 ATTAATCTAT TCATGAGGGA TCTGCTTCCA TGACAAAGGC ACATTCTGTT 
10951 AGATTCTACC CCCAATATTG GGGATCAAAT TTTAACATGA AGTGTGGAGG 
11001 GCTCAAATAT CCATACTATG GCAGCAGTAA ATGCATAAAT TTATTTTGTG 
11051 GATCTCTATT CTATATAGTA TTGGTGTATG TATCTGTTTT CATGCCACTG 
11101 CCATACTGTT TTGGTGATGA TATCTATGCT ATATATGTGT GTGTGTATAT 
11151 ATATATTATA TATATGTATA TATGTGTATA TTATATATAT GTATATATGT 
11201 GTATATTATA TATATATAAT ACTTTAAGTT TTATATATAT ATAAAATACT 
11251 TTAAGTTCAA GGGTACATGT GCAGGATGTG CAGGTCAGTT ACATAGGTAT 
11301 ACATGTGCCA TTTTGGTTTG CTGCATGCAT CAACTCATCA TTACATTAGG 
11351 TATTTCTCCT AATGCTATCC CTCCACCAGC CACCCAACCC CCAACAGGCC 
11401 AGGTGTGTGA TGTTCCCCGC CCTGTGTCCA TGTGTTCTCA TTGTTCACTT 
11451 CCTACCTAAA AGTGAGAACA TGCAGTGTTT GATTTTCTAT CCTTGTGATA 
11501 GTTTGCTGAG AATGACTGTT TTCAGCTTCA TCCATGTCCC TCAAAAGGAC 
11551 ATGAACTCAT CCTTATTTAT GGCTGCATAG TATTCCATGG TGTATATGTG 
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11601 CTACGTTTTC TTAATCCAGT CTATCACTGT TGGACATTTG GGTTGGTTCC 
11651 AAGTCTTTGC TATTGTGAAT AGTGCTACAA TAACCATATG TGTGCATGTG 
11701 TCTTTATAGC AACATGATTT ACTATCCTTT GTGTACATAC CCAGTAATGG 
11751 GATAACTGGG TCAAATGGTA TTTCTAGTTC TAGATCCTTG AGGAATCCCC 
11801 ACACTGTCTT CCACAATGGT TGAACTAATT TACATTCCCA CCAACAGTGT 
11851 AAAAACGTTC CTATTTCCCC ACATCCTCTC CAGTATCTGT TGTTTCCTGA 
11901 CTTTTTAATG ATGGCCATTC TAACTCACAT GAGATGGTAT CTCATTGTGG 
11951 TTTTTGTTTG CATTTCTCTG ATGACCAGTG ATGATGAGCA TTTTTTCATG 
12001 TGTCTTTTGG CTGCATAAAT GTCTTCTTTT GACAAGTGTC TGTTCATATC 
12051 CTTTGCCCAC TTTTCAATGG AGTTGTTTGT TTTTTTCCTG TAAATTTGTT 
12101 TAAGTTCATT GTAGATTCTG GATATTAGCC CTTTGTCAGA TGGGTAGATT 
12151 GCAAAAATTT TCTCCCATTC TGTAGGTTGC CTGTTCACCC TGATGGTAGT 
12201 TTCTTTTGCT GTGCAGAAGC TCTTTAGCTT AATTAGATCC CATTTGTCAA 
12251 TTTCGGCTTT TGTTGCCATT GCTTTTGGTG TTTTAGTCAT GAAACCCTTG 
12301 CCCAGGCCTA AGTCCTCAGT GGTATAGCCT AGGTTTTCTT CTAGGATTTT 
12351 TATGGTTTCA GGTCTAACAT TTAAGTCTTT AATCCATCTT AAATTAATTT 
12401 TTGTATAAGA TGTAAGAAGG GATCCGTTTC AACTTTCTAC ATATGGCTAG 
12451 CGTGTTTTCC CAACACCATT TATTAAATAG GGAATCCTTT CTCCATTTCT 
12501 TGATTTTGTC ATATTTGTCA AACATCACAT GGTTAGAGAT GTGTAGTGTT 
12551 ATCACTGAGG CCTCTTTTCT GACTCCATTG ATC TAT AT AT CTGTTTTGAT 
12601 ACCAATACCA TGTTGTTTTC GTTACTGCAA CCTTGTAATG CAATTTGACA 
12651 TTCAGGACCA TGATGCCTCC AGTCTCTTTT TTTTTTTCTA AATAATTTTT 
12701 TTGTCAATGT AAGCTCATTT TCGCTTCTTT CTGATCCATA AAGTATTTTT 
12751 TTCCCATTTT GTGGAGAACG CCGCNNNNNN NNNNNNNNNN NNNNNNNNNN 
12801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNGGCACA CCTCGTGCGC 
12851 ATATATATAT ATATATATAT ATATACCTCT ATATATATAT ACATACATAC 
12901 ATACATACAC ACCTCCTTGT CTGGTGTGGG ATCAGGGTAA TGCTAGCCTC 
12951 ACAAGATGAT ACTGAAGTGT TTTTGCCTTT TTGACTTTTT GATGGTTTGG 
13001 AAGAGTGAGA AAAAGTGTTA TTAATTATTC TTTAAATTTT GTTGAATTTC 
13051 ATAGTGAAGA CCTTAGCTCA CTGGCTTTTT TAATGAGAAC TTTATTACTG 
13101 ATTTAAACTT CTTCTTCATT ATTTATTTCT GCCTTGTTTT TATTTCTTCA 
13151 TAATCCAGTC CTATTTTATG TGTCCACTAA ATTGTTTATT TTCCTAGAAT 
13201 TTTTCCATTT ATTGGCATAT GCATGTCCAT AGAAGCCTTT TATAGTCCTT 
13251 TTCATTTCTA GTGTCATTTT TTTCCTTTTT TTTAAGAATC CTTAAGATTT 
13301 TAGAGATGAA ATGTCACTTT GTTACGCATA CTGGAGTGCG GTGACATTAT 
13351 TATAGCTCAC TGAAACCCAA ACTCCTGAGT TTAAGCAATC CTTCTACCTC 
13401 AAAATTCCAA AATTCCTGAG TAGCTGAGAC AGGCATACAC CATCAAGACT 
13451 GGCTAATTTA TTTCAAATTT TGTAGAGATG GGTTCTTACT AAGCTATTCT 
13501 CAATCTTTGG GCTTCAAGTG ATTCTTCAGC CTCTGTCTCT GAAAATGCTG 
13551 GGTTTATAGA TATGAGCCTC TATGCCTGAT TTGCTTTGTC TCTTTGTAAT 
13601 CTCCCATTTT ATTTGTGTCT TTTCTGGTTT GTTTCATTTT GTTATGTTTT 
13651 CAGTTACCTT GCTAAAGCTT TGTCGATTTT ATCTCTTCAA ACAACTAACT 
13701 CAATATTTTG CTGATTTTCC ATATAGTATT TTATTTCTAT TTCATTTATT 
13751 TCTGCTCTAA TCTTTGTTAA ATATCTTGTT TTCCTAATAA TTTTGAGTTT 
13801 CCTTGTTCTT GTTTTCTAAT TCCTTGCGAT GTTATCATAA ATTGTTTATT 
13851 TGATATCTTT CTACTTTTTT GATGTGTGTG TTCGTTGTTG TAGACTTTCC 
13901 TCTTTATTAT TCTGATTTCT TCCTCAATTC TCTAATATTA TGATTGCATT 
13951 ATTTTCCAAG TTTCTTTTGT TTTTTTATTT ATAGTTTATG TGATTCCTGA 
14001 ACTTGTCAAA GAGATTATTG TGAATTTGAT GTCGGATATT TAAGCATTTT 
14051 CAAAACTTTT GGTGCATTAT TGAAATTTTA TTGGTTTATT TTAGAGATGT 
14101 CATACTTCCC AGTTTTTTTT TAACAATACT TGCTCTTTAT ATTGATGTCT 
14151 ACATATTTAA AAAGATAACC ACCTGATTCA GCTTTTTAAG GTGATATGCA 
14201 GTGGTGTTAA GTGTGTACTG CTTAATATCA GAGCTGAATC ACTGCCCTGA 
14251 GGATTCTTTC TGTTCTGAGG AGAGCTTGTA GTTAATAGCA GAACCTAAAT 
14301 AGTGCAGTAG AGCTAAATCT CTTCCATGCT GTTGTTTTCC TGTCTGGGGA 
14351 AGACTTATCA TGACCATGAA AACATAATGC TGTGCCAGAA CTTAAACCCA 
14401 AACCTGTAGT AATTTCTGAG TTGAGGAAGG CTTAAGAAAT AAC TGGAACT 
14451 TAGTTACTAA CCTGATAGTT GTTTCTGAGT CAGAGAAATG CTCTGCATGA 
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14501 TCACCTGGGA TATTTGTAAA ATCTAACCAA AGATTCTAGC CTTCCCCTTG 
14551 GATTGTGCCT CCTGTACTAC TGTAGTGCTG GCTAGGTCCT CATCAGTGAA 
14601 TTCCCTGCTG ATAGGACCAC AAAGCATCTG CCAAGATCTG TTTGCCATTT 
14651 GCTGTGATTA GTGCTTCTGC TCTTTGCTTC CAATTCAACT CAGGTGGTTC 
14701 AGCCCTTCTG ACACTCCTAA TACCTCCTGT GGGATGGAAC ATAGAAGGCT 
14751 TCTCACAATG ATTCACACAC TGATATGGAG ATTGAATGTC CAGTTGCAAC 
14801 TATTTTCTTC CACCTGTGTA ATTGCAGGTA CAGGGAAGTT TTCTGTGACT 
14851 GATGCTATTT TGGTTTGGAG AATGGGGTGA TGTGGCACAA TGATCTTTCT 
14901 TCTTTCTGGT CATGGATTTT TTAATTTCCA TGAACCCATA AGATTTTTCA 
14951 CTTTTCTTCT GAGCTCTGGT GCTTTCAGAG TGGTATTTTT ATATTCGAAT 
15001 AGTTGC TAGT TGTACTTTTA AAAGCGATTG ATGCTGGAGG TCTTCTATTC 
15051 CACCATCTCG CTGATGTCAG TCCTCAAATA ATAATTTTAT ATTTTAGCAA 
15101 ATTATTTTGG TTTTAGGATT TTGTGTCTAC GTGACACAGA CATGAAAAGA 
15151 GATGTACTCA TTACTGAAAC TTTTTGCATA CTGTTTTGGT TGTGCGCCTT 
15201 TTCTAGTATG AATGATTACA TATTTAAGCC ACATGTTTTA TACATAGACT 
15251 GTCCTTTAAA GAGACTAGAT AGTTCTGTGT GTCAGCATAT AGGGACAGAA 
15301 TATAACTACA CATTAATAAT TTCTCAAGTA TTTATTTTAG AAGTGTAAGT 
15351 AACCTTTATT TTAATTTTTG TTATATTATG CCTCTGTAAT GCAGATAAAT 
15401 TTTTATCTTC AGGAAATGGA AAATTTTGTC CAGAGTTCAG GGGAAGATGG 
15451 TATTGTGGTG TTTTCTCTGG GGTCACTGTT TCAAAATGTT ACAGAAGAAA 
15501 AGGCTAATAT CATTGCTTCA GCCCTTGCCC AGATCCCACA GAAGGTCAGT 
15551 AAAACCTCCA ATCCTGATAA GCAGCTATTC ACATAATGAA ACAGTATGGT 
15601 TTTATTTGGG TCTTGAATCT CATTTTCCAC TTAGCATAAC AGGTACCAAA 
15651 ATTTGCAAAA CATTATAGTA GTGTACATGG GCATAACTGA TCATTTGCCT 
15701 ACTGAGTCTT GCTGTTACTG GAAACAACTT TCTTGATTGT CATTTGTTTA 
15751 TAATAAAATA GATATAATAA ATAAAGCTCT ACCTTATATT TTAGGATTTG 
15801 AAATCTAAAA GCGTGTGCCA ATGATTCCAA AAAAAAATTC TGACATCTAT 
15851 TATTTCAAAG GACCAGAAAA AGGAAAACTG ATATAAAAAA AAAAAGAAGA 
15901 ATCAATCTCA AGAATATCTT CTCATATTTG TGTGTATAAA AACTGTATTC 
15951 AGGGTAGTTT TGCTTAGAAA TAAAAGCTCA GATTAATGTA GTCTTTCTAA 
16001 ATAATTAGAA GTTTCAAAAG TAAAATGTCA ATTACAATTA TAGTATAGTA 
16051 ACAATTATTT AAGTAATGTA ATTATTTATG ATACTCCACT AATTTTAACT 
16101 TTATTATTAC TGTAATTCTA GAATTTCACA CTTTAGATAG TGCTATATAT 
16151 AAAC TATCCA AAAGATATTT CATTTTATAT TTAGCTAAAA TACTTCAAAC 
16201 TCAATAAAGG CAAGCATACT AATTAGGAAT TTGAAATATT GTAATTTCAA 
16251 TTATGAAATT ATCTGTTAAG TAGTTTGAAA CATCTATGCC GTTCTTTGTT 
16301 TTCAAATGTA TAAAATTTGT ATAGGTGTCC AACAAAGAAA AATTGTGTAA 
16351 AAAAAAGGTA C AATC TC AAA GAAAATTTAT CATTGAACAG TGGAACATAA 
16401 GTAATTTTCT AGCTCATTCT TCTTCAATAA AACAATTAAA TATAAGAAGA 
16451 AAGAGGCCAG GAAGGAAATA GAGAAGAAAA GACACCCGAT TATCCAAAAG 
16501 ACACACATAA TTGAAAGCAA ATTTTTATCT GCAGGGAACT GTAAATTTGA 
16551 TGGTAGAATG AGATTGGC TC CATGAGTTAA AATGACACAC AGATCAGGTA 
16601 CTTATAAAAT TTTTAATTCT TATATAAAAA TAGATTAGCC ACTGC TGAAT 
16651 TATTTTTTTA AATATTC AC T GGTATTCTCA TTCTCAAATA TTTTTAATTG 
16701 GTAATAAAAT AATAATAGCA TACCTAATAG GCAACTGGTA CACATTATTT 
16751 TAAAAGATCT TTGTAAAACG TCCTACTATA TCTTTCAGTC TTTACGCGGT 
16801 AGCTCTACAC ACCCCTGTCT CAACCATCAC CTGAAGTACA ATGAGTTTAT 
16851 AATTTATAAC TATATCTACA TCCTTAGAAT GCTAATATCC TGTGGTTCAC 
16901 TCTGTGAAAT ACATGTGTTT CTTCCGTAGG TGTTATGGAG GTACAAAGGA 
16951 AAAAAACCAT CCACATTAGG AGCCAATACT CGGNNNNNNN NNNNNNNNNN 
17001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNAAAAAAA 
17051 AAGACCCAAT CCCAAAGAAA ATTTATCATA GAACAATGGA ACATAGGTAA 
17101 TTCTCTAGCT CATTCGTCTT CAATAAAACA AATAAATATA AGAAGACAAA 
17151 GGTCAGGAAG GAAATAGAGA AGAAAAGATA ACCGATTATC CAAAATCACA 
17201 CACAAAATTG AAAGCAAATT TTATCTGTGG GGAACTGTAA ATTTGATGGT 
17251 AGAACCAGAA TAGTTCCATG ATTTGAAATG ACACAGAGAT CATGTACTTA 
17301 TAAAATATTT TATTCTTATA AGAAAATTGA GTAGCCAGTG CTGAATTACT 
17351 TTTTAATGAT TCACTGATAT TCTCATACTC AGATATTTTA ATTGATATTA 
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17401 AAATAATAAT AGTATACTTA ATAGTCAACT GGTACACATT ATTTGAAAGG 

17451 ACTTTTGTAA AAAGTCCTAC TATGTCTTTT ACTGTTTACA CAGTACCTCT 

17501 ACATACCCCT GTCTCAACCA ACACCTGAAG TACAATGAGT TTATAATTTA 

17551 TAACTATATC TACATCCTTA GAGTGCTAAT ATCCTGTGGT TCAATCTGTG 

17601 AAATACATGT GTTTCTTCCA TAGGTGTTAT AGAGATACAA TGGAAAAAAA 

17651 ACCATCCACA TTAGGAACCA ATACTCGGCT GTATGATGGG ATACCCCAGA 

17701 ATGATCTTCT TGGTAGGTCT ATGAGAAAGT AAAAATATGA ACTAGACGAG 

17751 GAAAAAATGA ATAAATGTTA AACAGCAAGC AAATTCAGCA AAGATCTAAA 

17801 ATTATAAAAC TTTATTTTAC TTACTCTTTT GAAGCAGATA TAATTAAAGG 

17851 ATTGACTAAA ATTGTATAGA TTCACACTTT CTATTGTTAA GGTGAGAGTG 

17901 ACAGGAAATT CAGAAGGAAT TAATGCCTAT TTTTCTGGAG ATAGAAATGA 

17951 TCTTTAGTAG CAATGCTCCA TGTGCTCACC TTCTAAAGAA AGTGCTGTAC 

18001 GCTTCAGTGA GTTATCTCGT AATTCCCATC TGTAGTTTTT AAATAATTTT 

18051 AAAAGTTTAG AATAAAATAT CTCACCATTT CTCATCCAAT TTACATACTA 

18101 GGTCATCCCA AAACCAAAGC TTTTATCACT CATGGTGGAA TGAATGGGAT 

18151 CTATGAAGCT ATTTACCATG GGGTCCCTAT GGTGGGAGTT CCCATATTTG 

18201 GTGATCAGCT TGATAACATA GCTCACATGA AGGCCAAAGG AGCAGCTGTA 

18251 GAAATAAACT TCAAAACTAT GACAAGCGAA GATTTACTGA GGGCTTTGAG 

18301 AACAGTCATT ACCGATTCCT CGTAAGTACT ACTGCTTGTA CAGACTGATC 

18351 TAACATTGAC TATGTTATAC ATTATACCAG AAAATGTTAA ATATCATCCT 

18401 GGTAGACATG TTGAGGGATT TTACTCCACA ATATTGAGTC ATTCATCACC 

18451 TTGTTACTGG AATAGTTGTG GAAATTGTAG TTCATAGAGT GTCAAACTTT 

18501 CTTCATGGAA ATATTAGGTT TAAGTTAACA ACTGGCTTAC TAAGCTTTTA 

18551 TTCACATCTT AATTTTACCC CATTTTGTTA AGAATATACT CTTTCAGTCT 

18601 CTCCACTATA TCTGTTTAAT ACTATGTAAC CAACAATATT CATGTCACAA 

18651 CCAGAATCAA TCTTTTACTG AACATGTTCT TGGCTTGCAT AACATATACT 

18701 ACGGTTTATC TACGTGTCTT TTATGAAAAC AAAACTACAA CTTTCTAAGT 

18751 TCTATGTGTG TTTTTCCCTT CCAGTTATAA AGAGAATGCT ATGAGATTAT 

18801 CAAGAATTCA CCATGATCAA CCTGTAAAGC CCCTAGATCG AGCAGTCTTC 

18851 TGGATCGAGT TTGTCATGCG CCACAAAGGA GCCAAGCACC TGCGATCAGC 

18901 TGCCCATGAC CTCACCTGGT TCCAGCACTA CTCTATAGAT GTGATTGGGT 

18951 TCCTGCTGAC CTGTGTGGCA ACTGCTATAT TCTTGTTCAC AAAATGTTTT 

19001 TTATTTTCCT GTCAAAAATT TAATAAAACT AGAAAGATAG AAAAGAGGGA 

19051 ATAGATCTTT CCAAATTCAA GAAAGACCTG ATGGGGTAAT CCTGTTAATT 

19101 CCAGCCACAT AGAATTTGGT GAAAACCTTG CTATTTTCAT ATTATCTATT 

19151 CTGTTATTTT ATCTTAGCTA TATAGCCTAG AATTCCACGA TCATGAGGTT 

19201 GTGAGTATAT CTCATTCTTT CGTTGTATTT TCCTAGGTGT CTTTACTCTC 

19251 TTCTCTCACT TTGTGACACA AGGACATGAA TACATCTAAA TTTTCCTATT 

19301 TCTGATATGA CTGTTTTGAT GATGTCATTA CTTCTATAAC CTTAAGTGAT 

19351 AGGGTGACAT GCAATATGAT TATTCCTGGT GTGCGCCCAA ACACATGGAT 

19401 ATAAAGAGGT AAAAAACTTA AAATTCACAA AATTCAGTAA ACCACACAAA 

19451 TCAGGTAAGT GTTCTATGAG ATTAGCTGGC TATGAGAAAC ATAATGATGT 

19501 TTCTTTTTCA ATTTAAATAA GCCCTTCTAC ATAGCCAGCA TCAGTGATCT 

19551 CAGAAAATAA ATTGCTAATA ATGATGACAT GGCATTATGC TTAGAAAAGT 

19601 TTGCTGTATT TCCATAGACC TCATCTAGAT GTCATGGCCT ACATTTCTGC 

19651 CATCACTCAA CCAATACTTT TTTCTGTTTT CTTGATGATA AAAAGACCTT 

19701 TCTCATGATT GCCATCAAAT AACAAAAGAA ACTATTTTTT TTCTCACATA 

19751 GAGAACATGT CAGTAAGATA TTCAAGGTGA ACAGATATTT TTGGGATTAG 

19801 TAACTATTTG AAATATGTGG TGATAATTAC TGAGTTTATA AAATTTATTT 

19851 GATAGTACAC TTAAAGAAGA TTTATATGTT TATTCTTTAA AAATGATGAA 

19901 TACTCATAAT TCTTATCTCT ATAATCAAAA GTATAATTTA CTGTAGAAAA 

19951 ATAAAGAGAT GCTTGTTCTG AAAGTAAGAT CAGTGAACTG CTTTTCAGTC 

20001 TCAATCTTTG AGAATTGTAA ATTCATCAAA TAATTGCTTA CATAGTAAAA 

20051 ATTTAAGGTA TTAGAAAACC TGCATAACAA ATAGTATTAT ATATTAAATA 

20101 TTTTGATATG TAAAGCTCTA CACAAAGCTA AATATAGTGT AATAATGTTT 

20151 ACACTAGTAA GCAAATATGT TAATCTTCTC ATTTTTTTAC TGTCATATAA 

20201 TCTTAGTGAT ATGCCTATTA ATAGTTTTAA ATAAATAAAT TGGCTTATCT 

20251 GGCTTTTTGA AAATTTTGAA ATTCTTACAG ATGTTGATTA GGTATATCTA 
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20301 CAAATTAATT TCAATTTTAA AATGATGATA TAAAAATAAA TATAAGTATT 

20351 TTTCTTGTGT ATGTATACAA TAAATATAAA TAAAATTGTT TACTGTTTTG 

20401 AAAGTTTCTT AAGTTTTTAC ACTGATATGT TTTTTGACTT TTACAATATT 

20451 ATTATAATCT AGGAAAAGCT GATTATATCT GTTTTAAGCC TCATCTTTTC 

20501 TCTGTAATTA AACACAGTAA TTTATTAACA TGCTGTGACA GGTGGGAAGC 

20551 CATTTCTGGA GTTGAGCCTG CTGACACTCT GGAGCTTTTT AGGTTGGACG 

20601 TTCATTGTAT GTGGGACTCT CTGCCTCTCG ATAGCTGTTG CTCATAAGAC 

20651 TCTCCTTCAT CAATCTGGCA TTGAATTTTG CGATCAGTTG CAATCAGAAT 

20701 CCAATTGGCC TTGCCGTTTT AGTATGTTCT ATCTTAACCA GCAATTTCTA 

20751 ACCAGGAGCC TGCCCAGGTT TGTTCTGTCT TCCCTGTAAG AAGCTCCCAG 

20801 CATAAATATT CTAAATTTTA CACTACTAAT CTATTAACCA ACC TTTGGAC 

20851 CATGTTCACT TTAGGTTGAG CATAGTGTGA TGAGATGCAA ATTAAATTAC 

20901 AATCCTATAG GTGTGTGTTA TAAATTTTAA AGTGTATAAA TTAAATAACA 

20951 CATTCTAAGT ATCCAACAAA GGTCAAAAAA ATGATATAAA GTCACCAAAC (SEQ ID NO: 3) 
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Context : 



DNA 

Position 
1735 



TACTTTAC TGATATAC ACCTATTTGACTTATTTTTATTTC TTTGCTTTAC TGATAAAAAG 
TTGTTTTGC TTTGC AATTTTC ATATAGTTGTGATCAGAGC TGGTCAATGC AAGAC ATGTT 
TTTATCCAAATATGTTTGAGAATTATGTAGAAACATGAAAAAAGGTACAATTATATCCGA 
CACTAAAATATTGTTTAATGTATTCCAACGAATTCTTATGCATAGACTGTTTCACAGAAC 
TAATATTCAGAGGATCCCAGTTCAAATGTCCTTAGCCTTAGACATGATTTGAATTTACAT 
[A, G] 

TATTGATTTGCTTTAAATAATTTTCCATTCAGTAAGCTGTGCCTAGCTGCAGATAGCCTA 
CCAGGCTTTATGGATCTAGGTAAACAATACAAATCTCTTGGCCTCAAGTCTACATTCAGA 
TATTAATTTAAAGGGGTACAGCTATATAGAGGTCACTGGCAAATTTTGGTAAAATAGGAT 
TATAGTAAAAGCCCCCTGACAAGATTGAAATTTAAAATAAAACAAAAGTGTTATCAAAGG 
GGTGAAAGAGCATTTTCCAATAAACAAAAGTGGGTTCTGGCCATGCATTCAGAAATTCCC 



1922 ATATTGTTTAATGTATTCCAACGAATTCTTATGCATAGACTGTTTCACAGAACTAATATT 
CAGAGGATCCCAGTTCAAATGTCCTTAGCCTTAGACATGATTTGAATTTACATATATTGA 
TTTGCTTTAAATAATTTTCCATTCAGTAAGCTGTGCCTAGCTGCAGATAGCCTACCAGGC 
TTTATGGATCTAGGTAAACAATAC AAATCTC TTGGCCTCAAGTC TAC ATTC AGATATTAA 
TTTAAAGGGGTACAGCTATATAGAGGTCACTGGCAAATTTTGGTAAAATAGGATTATAGT 
[A,Gj 

AAAGCCCCCTGACAAGATTGAAATTTAAAATAAAACAAAAGTGTTATCAAAGGGGTGAAA 
GAGCATTTTCCAATAAACAAAAGTGGGTTCTGGCCATGCATTCAGAAATTCCCCAACAAT 
TCTTTAAAAATCATGGAGCAGCTTGATATATAAGAAATTCATTTAATAACTATATTTATT 
ATGTAGCTCCAAC TTAC TAAATTATTGATTATTATATATTTTATAGAATTATC TATTGTG 
AGTCTAAATCAAGAGTATATATTCAAACAACTATAGGAAAAGGGATATCAGTCAATTTCA 

2361 C AGCTTGATATATAAGAAATTC ATTTAATAAC TATATTTATTATGTAGC TCC AACTTACT 

AAATTATTGATTATTATATATTTTATAGAATTATCTATTGTGAGTCTAAATCAAGAGTAT 
ATATTCAAACAACTATAGGAAAAGGGATATCAGTCAATTTCAATTCAAGGATTTATTTCC 
ATAAGTGC TTACGCACAGGTGTATTTC ATTTTATTATAC ATTGC TTTATTGTCC TTCACA 
AAAATTGCAATTTACAAATTAAAGGTTTTTGAAAACCTTGAATCAAGCTAATCAATTTGG 
[C,T] 

GTAATATTTCCAACAACAAGTGTGTACTTTTGACTCTATCACATATTGGCATTTATCATG 
CTTTTTCAAATTTTTCATTGTTATATCTGTTACGGTGATCTGGGATCAGTGTTCCTTGAT 
GGTTACACGTTTATTAGCTTGGGGGCACCTTGATGTGTTACAATATAAGACAGCAAACTT 
AATTATAAATGTTGTGCATGTACTAACTGCTCCGCTGATTCGTTTCCCCATCCCACTTCT 
TCTTAGGCCTCCCTATTCCCTGAGACACAGTAATATAACATACAATGACTTCTAAATGTT 

7371 AAATAAGTGATGAAGGAAGAC AAAGAATAAAAGAAAATTTC AATAAATGGAAAGTTTAAG 

TGTTTAATGATAGTGATGACTTTTACTCAAATAAGTGCTTAGAAGTCATCTTGTTTGTGA 
TTTATATGATGAATTCTGTGTTGTGACTATCCACTTTGAGCTCGTGAGAATGTTAGGTGA 
GGTTTAATAAAAGCCATTTGAGAAAAACAAGGTTTCAACCTCTGTGGACAGAAATCTAAA 
TATCGATAGTTATCAGGACAAAGTAGAGCTCATAGAAATAATTTTGCAGCCTGCAGGTTT 
[G,C] 

TTTTGGAGTGAAAATAAAATTGTATACTATATTCCTAAATCATCAGAGGAAAAAATTTAT 
AGTTCAAGGAATGTTGAAAGAAACAATATTGAGAAGTAAAAGTGAGTAATAGTTGTTATA 
GTTTTTTAATAGTTTTGTAAGTATGTCTTGAGTTCACTGTCCCAAAAGTGGCTATTAGCT 
CTAGCCTTGACCTGACAAGGTTCTAGGATATTTAGTCATGGATGTTCATAATCTACCTCT 
TACGGGATACTTTTTATTCTGATGAACAGCCTAATGCCTAAGTGTGCAATCTATACCAAG 



9558 TCCACATATAAGGGATACCATGCTATCTCTGCCTGGATTATTTCAGTTAACATTATGCCC 
TGGAGGTTCATTCATGTTTCTACAAATGACAGGATTTCATTCTTTTTTTTCCAATATATA 
TTTAATGAAATGGATATATATAAACATTGGAAAATGTATATATATATATATATCTCCAGT 
GGAATGCTATTGAGCTATAAAAAAGTTAATATATAATAGAAATAAAGCTTATATATATCT 
AATGGAATGGATATATATATATAATGGAATAGAAATATATATCTATACATATAAACACAC 
[G, A] 
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CAATATACATATCCATTTCATTGCATATATATATATATAGAGAGAGAGAGAGAGAGATAT 
TTTCAAATGTGTGTATATATATCCAATGGAATGGACATATATATATGTATATTTTTTCCA 
TATTTTCTTTATGTATTTCTTCATTAATGGATGTTTAGGTTGATTCATCCCTTGGGTATA 
TGAATAATGTTGATGTAAACATAGAAGGACAGATATCTCTATGACTTCTTAGTTTATTTA 
AATATACACCCAGTAATGGAAATGCTGTATAATATGGTAGTTCTATTTTCATTTTTTGAG 

10579 C AATGCTT TGATATAACCC ATTTATCTACTTTTCCTTTTGTTGGCTGTGCTTCTGAAGTC 

CTATCCAAAAAAATCCTTGCCTAGACCAATGTCACAAATCATTCCTCCTACAGTTTCTTC 
TAGTAGTTGTATAATGTTTGGCCTTATATTTAACTTTGTAATTCATTTTTACTTACTTTG 
TATATGGTGAGGGATAGAGGTCTAGTTTCATTTTCTGCATGTGGATATGCAGTTTTCCTA 
GCACCATTTAGTGAAGAGGTTGCCTTTTTTCTATTATGTGTTCTTGGCACCTTTGTCAAA 
[T,G,A] 

GTCAGTTAGCTGCTATATTCCTCCATTTGTGTTGTTATAGAGGAACACATGAGACTAGCA 
AATTTATATATCAAATAGAATTATTTGAATGATAGTTCTGCATACTGTACAAGAAGCACA 
GCACTGACTTCTGCTTGGCCTCTGGTAAGGTTCTCAAGATGCTTCCACTTGTGGTAGAAG 
GCAAACATGAGCTGGTATATGCAAAGGTCTCATGACAAGAGAGGAAACCATAAAGAGGGG 
ATGTGAGGGAGTGCCAGGTTTTGTAAAACAACTAGCTCTTCTGGGAACTAATAGAGTAAA 

10625 GTGCTTCTGAAGTCCTATCCAAAAAAATCCTTGCCTAGACCAATGTCACAAATCATTCCT 
CCTAC AGTTTCTTC TAGTAGTTGTATAATGTTTGGCC TTATATTTAAC TTTGTAATTCAT 
TTTTACTTACTTTGTATATGGTGAGGGATAGAGGTCTAGTTTCATTTTCTGCATGTGGAT 
ATGCAGTTTTCCTAGCACCATTTAGTGAAGAGGTTGCCTTTTTTCTATTATGTGTTCTTG 
GCACCTTTGTCAAAAGTCAGTTAGCTGCTATATTCCTCCATTTGTGTTGTTATAGAGGAA 
[C,T] 

ACATGAGACTAGCAAATTTATATATCAAATAGAATTATTTGAATGATAGTTCTGCATACT 
GTACAAGAAGCACAGCACTGACTTCTGCTTGGCCTCTGGTAAGGTTCTCAAGATGCTTCC 
AC TTGTGGTAGAAGGC AAAC ATG AGC TGGT AT ATGC AAAGGTC TC ATG AC AAGAG AGGAA 
ACCATAAAGAGGGGATGTGAGGGAGTGCCAGGTTTTGTAAAACAACTAGCTCTTCTGGGA 
ACTAATAGAGTAAAAATTCGCCTCCCAGGCAGGGGATTAATCTATTCATGAGGGATCTGC 

11147 AC AACTAGC TC TTC TGGGAACTAATAGAGTAAAAATTCGCCTCCCAGGC AGGGGATTAAT 

CTATTCATGAGGGATCTGCTTCCATGACAAAGGCACATTCTGTTAGATTCTACCCCCAAT 
ATTGGGGATCAAATTTTAACATGAAGTGTGGAGGGCTCAAATATCCATACTATGGCAGCA 
GTAAATGCATAAATTTATTTTGTGGATCTCTATTCTATATAGTATTGGTGTATGTATCTG 
TTTTCATGCCACTGCCATACTGTTTTGGTGATGATATCTATGCTATATATGTGTGTGTGT 
[A, G] 

TATATATATTATATATATGTATATATGTGTATATTATATATATGTATATATGTGTATATT 
ATATATATATAATACTTTAAGTTTTATATATATATAAAATACTTTAAGTTCAAGGGTACA 
TGTGCAGGATGTGCAGGTCAGTTACATAGGTATACATGTGCCATTTTGGTTTGCTGCATG 
CATCAACTCATCATTACATTAGGTATTTCTCCTAATGCTATCCCTCCACCAGCCACCCAA 
CCCCCAACAGGCCAGGTGTGTGATGTTCCCCGCCCTGTGTCCATGTGTTCTCATTGTTCA 

15131 CAGGGAAGTTTTCTGTGACTGATGCTATTTTGGTTTGGAGAATGGGGTGATGTGGCACAA 
TGATCTTTCTTCTTTCTGGTCATGGATTTTTTAATTTCCATGAACCCATAAGATTTTTCA 
CTTTTCTTCTGAGCTCTGGTGCTTTCAGAGTGGTATTTTTATATTCGAATAGTTGCTAGT 
TGTACTTTTAAAAGCGATTGATGCTGGAGGTCTTCTATTCCACCATCTCGCTGATGTCAG 
TCCTCAAATAATAATTTTATATTTTAGCAAATTATTTTGGTTTTAGGATTTTGTGTCTAC 
[C,T,G] 

TGACACAGACATGAAAAGAGATGTACTCATTACTGAAACTTTTTGCATACTGTTTTGGTT 
GTGCGCCTTTTCTAGTATGAATGATTACATATTTAAGCCACATGTTTTATACATAGACTG 
TCC TTTAAAGAGACTAGATAGTTC TGTGTGTCAGC ATATAGGGAC AGAATATAAC TACAC 
ATTAATAATTTCTCAAGTATTTATTTTAGAAGTGTAAGTAACCTTTATTTTAATTTTTGT 
TATATTATGCCTCTGTAATGCAGATAAATTTTTATCTTCAGGAAATGGAAAATTTTGTCC 

15221 TTAATTTCCATGAACCCATAAGATTTTTCACTTTTCTTCTGAGCTCTGGTGCTTTCAGAG 
TGGTATTTTTATATTCGAATAGTTGCTAGTTGTACTTTTAAAAGCGATTGATGCTGGAGG 
TCTTCTATTCCACCATCTCGCTGATGTCAGTCCTCAAATAATAATTTTATATTTTAGCAA 
ATTATTTTGGTTTTAGGATTTTGTGTCTACGTGACACAGACATGAAAAGAGATGTACTCA 
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TTACTGAAACTTTTTGCATACTGTTTTGGTTGTGCGCCTTTTCTAGTATGAATGATTACA 
[A,G,T] 

ATTTAAGCCACATGTTTTATACATAGACTGTCCTTTAAAGAGACTAGATAGTTCTGTGTG 
TCAGCATATAGGGACAGAATATAACTACACATTAATAATTTCTCAAGTATTTATTTTAGA 
AGTGTAAGTAACCTTTATTTTAATTTTTGTTATATTATGCCTCTGTAATGCAGATAAATT 
TTTATCTTCAGGAAATGGAAAATTTTGTCCAGAGTTCAGGGGAAGATGGTATTGTGGTGT 
TTTC TCTGGGGTCAC TGTTTCAAAATGTTACAGAAGAAAAGGCTAATATCATTGC TTC AG 

15778 GTTTCAAAATGTTAC AGAAGAAAAGGCTAATATCATTGCTTCAGCCCTTGCCCAGATCCC 

ACAGAAGGTCAGTAAAACCTCCAATCCTGATAAGCAGCTATTCACATAATGAAACAGTAT 
GGTTTTATTTGGGTCTTGAATCTCATTTTCCACTTAGCATAACAGGTACCAAAATTTGCA 
AAACATTATAGTAGTGTACATGGGCATAACTGATCATTTGCCTACTGAGTCTTGCTGTTA 
CTGGAAACAACTTTCTTGATTGTCATTTGTTTATAATAAAATAGATATAATAAATAAAGC 
[T,C3 

CTACCTTATATTTTAGGATTTGAAATCTAAAAGCGTGTGCCAATGATTCCAAAAAAAAAT 
TCTGACATCTATTATTTCAAAGGACCAGAAAAAGGAAAACTGATATAAAAAAAAAAAGAA 
GAATCAATCTCAAGAATATCTTCTCATATTTGTGTGTATAAAAACTGTATTCAGGGTAGT 
TTTGCTTAGAAATAAAAGCTCAGATTAATGTAGTCTTTCTAAATAATTAGAAGTTTCAAA 
AGTAAAATGTCAATTACAATTATAGTATAGTAACAATTATTTAAGTAATGTAATTATTTA 

15895 TATGGTTTTATTTGGGTCTTGAATCTCATTTTCCACTTAGCATAACAGGTACCAAAATTT 
GCAAAACATTATAGTAGTGTACATGGGCATAACTGATCATTTGCCTACTGAGTCTTGCTG 
TTACTGGAAACAACTTTCTTGATTGTCATTTGTTTATAATAAAATAGATATAATAAATAA 
AGCTCTACCTTATATTTTAGGATTTGAAATCTAAAAGCGTGTGCCAATGATTCCAAAAAA 
AAATTCTGACATC TATTATTTCAAAGGACCAGAAAAAGGAAAAC TGATATAAAAAAAAAA 
[-,A] 

GAAGAATCAATCTCAAGAATATCTTCTCATATTTGTGTGTATAAAAACTGTATTCAGGGT 
AGTTTTGCTTAGAAATAAAAGCTCAGATTAATGTAGTCTTTCTAAATAATTAGAAGTTTC 
AAAAGTAAAATGTCAATTACAATTATAGTATAGTAACAATTATTTAAGTAATGTAATTAT 
TTATGATAC TCCAC TAATTTTAAC TTTATTATTACTGTAATTCTAGAATTTCACACTTTA 
GATAGTGCTATATATAAAC TATCC AAAAGATATTTC ATTTTATATTTAGC TAAAATACTT 

19786 GAAACATAATGATGTTTCTTTTTCAATTTAAATAAGCCCTTCTACATAGCCAGCATCAGT 
GATCTCAGAAAATAAATTGCTAATAATGATGACATGGCATTATGCTTAGAAAAGTTTGCT 
GTATTTCCATAGACCTCATCTAGATGTCATGGCCTACATTTCTGCCATCACTCAACCAAT 
ACTTTTTTCTGTTTTCTTGATGATAAAAAGACCTTTCTCATGATTGCCATCAAATAACAA 
AAGAAACTATTTTTTTTCTCACATAGAGAACATGTCAGTAAGATATTCAAGGTGAACAGA 
[-,T] 

ATTTTTGGGATTAGTAACTATTTGAAATATGTGGTGATAATTACTGAGTTTATAAAATTT 
ATTTGATAGTACACTTAAAGAAGATTTATATGTTTATTCTTTAAAAATGATGAATACTCA 
TAATTCTTATCTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGT 
TC TGAAAGTAAGATC AGTGAACTGC TTTTCAGTCTCAATCTTTGAGAATTGTAAATTC AT 
CAAATAATTGCTTACATAGTAAAAATTTAAGGTATTAGAAAACCTGCATAACAAATAGTA 

20157 ACACTTAAAGAAGATTTATATGTTTATTCTTTAAAAATGATGAATACTCATAATTCTTAT 
CTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGTTCTGAAAGTA 
AGATCAGTGAACTGCTTTTCAGTCTCAATCTTTGAGAATTGTAAATTCATCAAATAATTG 
CTTACATAGTAAAAATTTAAGGTATTAGAAAACCTGCATAACAAATAGTATTATATATTA 
AATATTTTGATATGTAAAGCTCTACACAAAGCTAAATATAGTGTAATAATGTTTACACTA 
[G,A] 

TAAGCAAATATGTTAATCTTCTCATTTTTTTACTGTCATATAATCTTAGTGATATGCCTA 
TTAATAGTTTTAAATAAATAAATTGGCTTATCTGGCTTTTTGAAAATTTTGAAATTCTTA 
CAGATGTTGATTAGGTATATCTACAAATTAATTTCAATTTTAAAATGATGATATAAAAAT 
AAATATAAGTATTTTTCTTGTGTATGTATACAATAAATATAAATAAAATTGTTTACTGTT 
TTGAAAGTTTCTTAAGTTTTTACACTGATATGTTTTTTGACTTTTACAATATTATTATAA 

20246 GAAAAATAAAGAGATGCTTGTTCTGAAAGTAAGATCAGTGAACTGCTTTTCAGTCTCAAT 
CTTTGAGAATTGTAAATTCATCAAATAATTGCTTACATAGTAAAAATTTAAGGTATTAGA 
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AAACCTGCATAACAAATAGTATTATATATTAAATATTTTGATATGTAAAGCTCTACACAA 
AGCTAAATATAGTGTAATAATGTTTACAC TAGTAAGC AAATATGTTAATCTTC TCATTTT 
TTTAC TGTC ATATAATC TTAGTGATATGCCTATTAATAGTTTTAAATAAATAAATTGGCT 
[T,C] 

ATCTGGCTTTTTGAAAATTTTGAAATTCTTACAGATGTTGATTAGGTATATCTACAAATT 
AATTTCAATTTTAAAATGATGATATAAAAATAAATATAAGTATTTTTCTTGTGTATGTAT 
AC AATAAATATAAATAAAATTGTTTACTGTTTTGAAAGTTTC TTAAGTTTTTAC AC TGAT 
ATGTTTTTTGACTTTTACAATATTATTATAATCTAGGAAAAGCTGATTATATCTGTTTTA 
AGCCTCATCTTTTCTCTGTAATTAAACACAGTAATTTATTAACATGCTGTGACAGGTGGG 

20681 TAAAATTGTTTACTGTTTTGAAAGTTTCTTAAGTTTTTACACTGATATGTTTTTTGACTT 
TTACAATATTATTATAATCTAGGAAAAGCTGATTATATCTGTTTTAAGCCTCATCTTTTC 
TCTGTAATTAAACACAGTAATTTATTAACATGCTGTGACAGGTGGGAAGCCATTTCTGGA 
GTTGAGCC TGCTGACACTCTGGAGCTTTTTAGGTTGGACGTTCATTGTATGTGGGAC TCT 
CTGCCTCTCGATAGCTGTTGCTCATAAGACTCTCCTTCATCAATCTGGCATTGAATTTTG 
[C,A] 

GATCAGTTGCAATCAGAATCCAATTGGCCTTGCCGTTTTAGTATGTTCTATCTTAACCAG 
CAATTTCTAACCAGGAGCCTGCCCAGGTTTGTTCTGTCTTCCCTGTAAGAAGCTCCCAGC 
ATAAATATTCTAAATTTTACACTACTAATCTATTAACCAACCTTTGGACCATGTTCACTT 
TAGGTTGAGCATAGTGTGATGAGATGCAAATTAAATTACAATCCTATAGGTGTGTGTTAT 
AAATTTTAAAGTGTATAAATTAAATAACACATTCTAAGTATCCAACAAAGGTCAAAAAAA 



20819 AATTTATTAACATGCTGTGACAGGTGGGAAGCCATTTCTGGAGTTGAGCCTGCTGACACT 
CTGGAGCTTTTTAGGTTGGACGTTCATTGTATGTGGGACTCTCTGCCTCTCGATAGCTGT 
TGCTCATAAGACTCTCCTTCATCAATCTGGCATTGAATTTTGCGATCAGTTGCAATCAGA 
ATCCAATTGGCCTTGCCGTTTTAGTATGTTCTATCTTAACCAGCAATTTCTAACCAGGAG 
CCTGCCCAGGTTTGTTCTGTCTTCCCTGTAAGAAGCTCCCAGCATAAATATTCTAAATTT 
[T,C] 

ACACTACTAATCTATTAACCAACCTTTGGACCATGTTCACTTTAGGTTGAGCATAGTGTG 
ATGAGATGCAAATTAAATTACAATCCTATAGGTGTGTGTTATAAATTTTAAAGTGTATAA 
ATTAAATAACACATTCTAAGTATCCAACAAAGGTCAAAAAAATGATATAAAGTCACCAAA 
C 
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